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TRAINING FOR LONG DURATION SPACE MISSIONS 

Joseph H. Goldberg 
Assistant Professor 

Department of Industrial & Management Systems Engineering 
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The successful completion of an extended duration manned mission to Mars will require renewed 
research effort in the areas of crew training and skill retention techniques. The current estimate 
of in-flight transit time is about nine months each way, with a six month surface visit, an order of 
magnitude beyond previous U.S. space missions. Concerns arise when considering the level of skill 
retention required for highly critical , one-time operations such as an emergency procedure or a 
Mars orbit injection. 

The objectives of this research project were to review the factors responsible for the level of 
complex skill retention, to sug^t optimal ways of refreshing degraded skills, and to outline a 
conceptual crew training design for a Mars mission. 

Currently proposed crew activities during a Mars mission were reviewed to identify the spectrum 
of skills which must be retained over a long time period. Skill retention literature was reviewed, 
to identify those factors which must be considered in deciding when and which tasks need 
retraining. Task, training, and retention interval factors were identified. These factors were then 
interpreted in light of the current state of spaceflight and adaptive training systems. Finally, the 
retention factors formed the basis for a conceptual design of Mars mission training requirements. 
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TRAINING FOR LONG DURATION SPACE MISSIONS 

It is human nature to for^t highly learned information. Over time, psychomotor skills that may 
have been over learned degrade into awkward movements at a later time, while orctered sequences 
and events rapidly become disorctered. The study of human skill retention and degradation has been 
ongoing for many decades; useful information exists, but a comprehensive model of skill retention 
as a function of independent task and individual factors must still be developed. 

This paper considers these skill retention factors in light of a long-duration spaceflight, such as a 
manned mission to Mars. Retention of finely tuned skills and knowledge is absolutely necessary 
for the successful completion of such a mission, yet man-machine system complexities are 
becomming more and more complex. These skill retention issues also have implications that go far 
beyond manned spaceflight. Industry must train workers, often for long periods of time and with 
concomitant losses in productivity. Colleges and universities are also in the business of training 
individuals with skills and knowledge for long-term retention. Clearly, accurate long-term 
retention of skills and knowlec^ is important for productivity and safety within the entire 
society. 

This paper is divided into three major sections in its examination of long duration skill retention 
for manned spaceflight. ( 1 ) Currently proposed crew activities in a manned mission to Mars are 
reviewed. This information gave a concrete focus to the skill retention issues described here, and 
allowed bounds to be placed upon the duration and nature of training and retention. (2) Recent 
psychological and Human Factors literature (within the past 30 years) on factors influencing 
long-term retention was reviewed, concentrating on results and conclusions, rather than a 
critique of methodology. The purpose of this section was to provide a framework by which 
retention tactors could be studied, and to subsequently provide the necessary framework for a 
future model of skill retention time and quality. ( 3) A section on training for space missions was 
included. The intent of this section was to provide a foundation from which longer duration 
retention training could build, and to outline the required training elements for an advanced, long 
duration Mars mission. While much of such a conceptual design is merely an exercise in futurist 
guesswork , an attempt was made to logically build on the concepts presented in earlier sections of 
this report. A description of the research yet needed to develop a working skill retention model 
was also included. Such a model would greatly aid the conceptual design of Mars and other long 
duration missions, as well as industrial job design. Information for Sections 1 and 3, where not 
otherwise noted, was obtained from personal communication and experience at the Johnson Space 
Center , Houston, Texas. The author wishes to acknowledge Jack James, Andy Retro, and John Aired 
for their valuable assistance. 


1. MARS MISSION ACTIVITIES 

A current scenario for an interplanetary mission to Mars includes several distinct phases; 

1 . Earth lift-off to low Earth orbit ( possible at space station) 

2. Taxi-transfer from space station to orbiting interplanetary vehicle 

3. Transit to Mars 

4. Docking with second orbital transfer vehicle at Mars 
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5. Land on surface 

6. Reverse sequence for return mission 

The time involved for such a mission is on the order of 5 to 9 months each way. depending on 
orbits, trajectories, etc. , with a 6 month or longer surface visit. Using current and near-term 
technology, the transit times should not appreciably differ from this estimate, but the time spent 
on the surface could dramatically change with the addition of a permanent manned Martian base. 


1.1. CREW ACTIVITIES 

The activities to be carried out in a Mars mission are as varied as those in everyday life. They 
may be broken down into four areas, as shown below. Sources of information on crew activities 
and events include Oberg ( 1982),0berg andOberg ( 1986), Joels ( 1 985), Conners etal. 

( 1 985), and National Commision on Space ( 1 986). 

Spacecraft Control and Maintenance . These critical activities will insure that mission success and 
crew safety will not be comprimised. Crew members, armed with automated equipment and 
extensive computer programs, must serve as diagnosticians, continuously asking new questions 
about the status of equipment. As discussed by others, maintenance will range from simple 
modular replacement of Lithium cannisters, to large-scale reconstruction or evacuation of space 
vehicle apparatus. Both dexterity and decision ability will be required. 

Scientific Study. Much investigation will continue to be performed in space. Many accounts have 
indicated an even broader range of research topics than in previous space missions. These will 
include not only physical and hard science topics, but will be expanded to social and behavioral 
science issues including space habitability, behavioral interaction, and group power structures. 

Crew Health Maintenance. The health of the crew will include both regular physiological and 
psychological screening. Many innovative diagnostic and treatment procedures will be developed 
for long duration space missions, based on prior space station experience. Intelligent computer 
programs, in the form of expert systems, will likely be extensively used as guides for diagnosing 
and studying new forms of illnesses. 

Recreation. Many have stressed the importance of recreation in an interplanetary mission ( e.g. , 
Fraser, 1 968). Alternative cognitive and physical activites during off-duty time will be 
important to maintaining a healthy crew. 


1 2. FACTORS IN A LONG DURATION MISSION 

Skill training and retention requirements for a Mars mission will necessarily differ from that 
required by all previous missions, as determined by criticality and duration of events. Those 
skills that quickly degrade must be refreshed often or continuously, while better retained skills 
need only be refreshed periodically. A training program for this mission must consider several 
factors that are unique to a manned interplanetary journey, as listed below. 
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Skill Retention Duration. The required skill retention interval , between training and actual 
performance, may be 6 months or more. This is an order of magnitude greater than previous U.S. 
missions, and presents many unknowns for complex skill and procedures retention. Special 
on-board refresher training will be required for spme of these degraded skills. 

Crew Autonomy . As one-way communication lags of 1 0 to 30 minutes will be encountered near 
Mars, Earth-based mission control will be of little use. Instead, ground-control will serve as an 
independent opinion source and coach for an autonomous crew. The crew of 6 to 8 will function as 
a team , with each member contributing complementary expertise. Crew training thus must focus 
on enhancing those traits that increase this autonomy, and counter the negative effects of group 
thinking. 

Crew Confinement . The adverse effects of long-term confinement must be well understood before 
undertaking this mission. Training for long-term confinement must be considered, and techniques 
of countering confinement, such as projecting video landscapes, may be necessary. Study of analog 
confinement environments, such as prison or arctic stations, will aid in this definition. . 

Criticality of .Skills. Some required skills, such as orbital docking, will have a criticality beyond 
all other skills. Many of these will be performed only once or twice in amission, after a long 
no-practice duration. The effects of real and perceived skill criticality on performance and 
training must be understood before undertaking a Mars mission. 

Automation . Extensive use of artificial intelligence and automated sensing and diagnosing 
apparatus will be used for routine spacecraft control and maintenance. The crew will be 
responsible for monitoring this equipment, and factors determining crew monitoring or vigilance 
performance must be understood. A useful human- machine allocation model must be developed, 
and training for this will be required. NASA has already taken a first step in defining this model 
(von Tiesenhausen, 1982). 

Workload. The effects of mental and physical workload must be modeled before initiating a long 
mission, to allow a constant performance level within an autonomous crew. The choice of how 
many crew members to allocate to tasks should be determined via a generic workloal mocteling 
computer program. 

Environment . The adverse effects of vibration, noise, radiation, ion concentrations, and carbon 
dioxide are among the many environmental factors whose effects will be felt over the entire 
mission. The effects of these factors on health and skill retention must be considered in the design 
of the Mars mission. 

These important factors must all be considered when designing a training program for a long 
duration mission. While shorter duration mission crews have tolerated and even performed well 
under some of these factors, their effects will be exacerbated by long-term confinement. Since a 
Mars mission is an order of magnitude bevond current missions in duration and complexity . its 
training program cannot be evolutionarily develnned Instead, a rethinking of training is 
required; a model specifying training needs by type of skill and degradation level must be 
developed. The purpose of this paper is to take an initial step towards such a model , by indicating 
those factors that affect skill retention, and thus training requirements. 
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2. SKILL RETENTION 

The duration and quality of skill retention should necessarily determine the training requirements 
of a long duration space mission. Skills that quickly (tegrade must often be refreshed, whereas 
better retained skills may be neglected for a longer time. Before consictering this literature, 
several qualifications must be made, however. ( 1 ) Reports of studies in this area are often not 
readily obtainable. This may be due to the fact that much of the training research has been 
conducted in the private and military sectors, which have little impetus to publish in widely 
distributed publications. Also, much of this research is very task specific, and investigators may 
have have felt that their research would have low utility outside their immediate scope. ( 2) The 
major retention factors are covered below as discrete topics, but all are intimately Intertwined 
and confounded. Differences in the length of a post- training retention interval , for example, are 
confounded with the type and duration of initial training. Conclusions drawn here must clearly be 
interpreted with a great deal of caution. To gain a better understanding of these factors, however, 
they are discussed separately, ignoring conjoint and interactive effects. ( 3) In some cases, 
conclusions were necessarily drawn from very few studies, clearly scientifically inappropriate. 
This was pragmatically done to at least provide a direction for future research needs and 
developments. 

Naylor and Bri^ (1961) reviewed over 60 years of literature, and created the first 
categorization of retention- influencing variables. ( 1 ) Task variables included the 
procedural/tracking task dichotomy introduced below. They raised the important issue that the 
difficulty and organization of a task is likely responsible for observed retention differences. ( 2) 
training variables inducted three subclasses of factors: the amount of initial training, distribution 
of training sessions, and transfer effects from other tasks. ( 3) Retention interval variables 
included those factors present within this period. ( 4) Recall variables consisted of other 
retention- influencing factors, such as the training fidelity, or the presence of any warmup 
activity prior to retention testing. The present review drew heavily on this work, and extended 
their factor categories. A subsequent review (Gardl in and Sitter ley, 1 972) covered many skill 
retention studies, under contract to NASA. These investigators provided annotated reviews of many 
studies that were directly applicable to the piloting of space vehicles. The present review also 
drew on this paper, but was broater in coverage. 

Selected skill retention studies cited below are summarized in Table 1 , which presents the 
following information: ( 1 ) Investioatorfs). (2) Retention: time interval between end of training 
and initial retest, ( 3) Task: type of performance task. P: procedural (discrete), T: tracking and 
control (continuous), (4) Indp. Var.: independent or manipulated variable(s); D: duration of 
training, R: retention interval, S: structure of training, F: fidelity of training, 0: organization of 
task , RR: retention interpolated activity or rehearsal , ( 5) Task Description . ( 6) Cmolxtv: 
complexity of the task( s) , subjectively estimated by the number and type of simultanous 
activities that had to be performed, ( 7) Training: method of training; duration or criterion, ( 8) 
^Ss: number of subjects tested across entire study, ( 9) Exoer: subjective subject experience at 
task ; all subjects were inexperienced in abstract tracking tasks, whereas some aircraft control 
studies utilized experienced pilots; I: inexperienced, E: experienced, ( 1 0) Dependent Var. : 
dependent or measured performance variable(s) 
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The results of these studies, referred to throughout this report, are summarized in F igures 1,2, 
and 3. Manipulations of retention interval , training duration, and training organization are 
shown, and every attempt was made to combine similar studies with identical dependent 
parameters into one figure. Figure 1 shows performance in procedural tasks, or those requiring 
cognitive control or sequencing over many procedural steps. Note that three time scales, to allow 
sufficient resolution, were used on the retention time axes; 0-24 months, 0-6 months, and 0-4 
weeks. Depenctent variables here included both errors and time to complete procedures. F igure 2 
shows performance in simple tracking tasks over the same retention intervals as the procedural 
tasks. Dependent measures here included integrated error in volts, inches, or arbitrary numbers, 
measuring the deviation between a target and one's ability to follow it. Other measures incluctej 
the acqusition time to capture a target, or the percentage of total time on a target. All of these 
parameters generally required some form of continuous sampling by the experimental apparatus. 
Figure 3 also shows continuous tracking, but only for studies which presented much more complex 
flight control tasks. These experiments often used open- or closed- loop simulators of airplanes or 
space vehicles. All retention interval axes here were 0-6 months. Dependent measures usually 
consisted of a large collection of parameters, of which a subset was chosen, such as altitude error 
from a preset flight path. 
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Table 1 (cont). Skill Retention Studies 
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Figure 3. Complex Flight Control Skill as a Function of Retention Time Interval 
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2.1. TASK FACTORS 

Those factors affecting skill retention that are direct properties of a task are considered in this 
section. Clear trends that appear in these factors are valuable in the design and evaluation of task 
training, as those factors responsible for a large amount of skill degradation are good candidates 
for potential elimination or control. Presence of a large number of these critical factors can point 
to tasks requiring frequent refresher training. Great care must be used when evaluating tasks 
containing more than one of these factors, as interacting factors have rarely been investigated in a 
controlled manner. Anything other than qualitative comparisons across studies are dangerous, due 
to countless numbers of uncontrolled factors. Rather , generalizations should be drawn by first 
noting within-study conclusions, then qualitatively comparing these across many studies. 

2.1.1. Type of Task 

Since some of the earliest skill retention research, a distinction has been made between overall 
types of tasks. Procedural tasks are those requiring discrete, ordered responses. Some have 
labelled these as cognitive tasks, referring to the large amount of non-automatic cognitive 
resource required when trying to recall a long sequence of task steps, while others have called 
them discrete tasks, for the isolated responses that are required. Examples here include 
checklists on aircraft or space vehicles, emergency procedures, and more abstract tasks such as 
setting sequences of switches in a proper order. The other class of tasks are those which require 
psychomotor skills for successful completion. Some have also labelled these as continuous control 
or tracking tasks, bKause discrete responses are not given. Typically, these require an individual 
to keep some stimulus on target, or within a specified range of conditions. Common examples here 
are driving a car , controlling an airplane or space vehicle, or simply manually controlling a lever 
so that a displayed shape remains between two points on an oscilloscope. Most real-world tasks 
contain an element of both of these. Early research efforts simplified these as much as possible to 
obtain a high degree of experimental control. These studies indicated that procedural skills 
degrade more quickly than operational or continuous skills. Only those studies which presented 
both types of tasks to subjects, measuring relative skill degradation differences, are appropriate 
in the present assessment. 

In early studies, continuous tasks were simple tracking movements while procedural tasks 
consisted of sequence memorization. Ammons et al. ( 1 958) tested over 1 000 subjects on either a 
17-step procedural task or a model airplane control task, over many retention intervals. Greater 
skill loss occurred on the procedural , cognitive task than on the motor control task over retention 
intervals of up to two years. Naylor et al. ( 1 962) combined a procedural switch.setting task ( 9 
sets of 3 switches) with a three dimensional joystick-controlled tracking task and found similar 
skill retention for both tasks. In this instance, however, Gardlin and Sitterley ( 1 972) noted that 
the procedural task was very simple, and greater skill d^radation relative to the tracking task 
might have otherwise been expected. Using only a tracking task , Trumbo et al. ( 1 965a) broke 
down overall performcance into both temporal and spatial accuracy dependent measures. The 
former measured anticipations and lag time, whereas the latter measured absolute positioning 
accuracy. Interestingly, the temporal skill performance was lost more quickly than the spatial 
performance over the one week to five month retention intervals. The better subjects may have 
emphasized the temporal aspects of the task more than the spatial aspects, suggesting that more 
effort should be spent on maintaining temporal task performance. 
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Later studies have used more complex, flight control tracking tasks with prxedural checklists, 
measuring performance on both. Mengelkoch et al. ( 1 97 1 ) combined a flight control task with a 
procedural checklist task containing 125 discrete items. Over a four month retention interval, 
discrete procedural responses were more susceptible to forgetting than the continuous flight 
control responses. Though the procedural losses were great enough to be practically significant, 
the investigators were careful to qualify this conclusion. It is not possible to define equal levels of 
learning between the two types of dependent measures; tracking is measured as a continuous 
variable (e.g., altitude error), while procedural lists are measured by percentage error. In this 
context of real aviation training, however, more emphasis should logically be placed on the 
learning and retention of procedural checklists than on continuous flight control ( Gardlin and 
Sitterley, 1972). 

Sitterley and others conducted a series of skill retention studies for NASA in the early 1 970‘s. All 
tasks required both a:tive flight control and the use of procedural checklists, and utilized a 
complex , closed- Irop space shuttle cockpit simulator. Comparing manual control with emergency 
procedural skill retention from one to six months, Sitterley and Berge ( 1 972) found that 
procedural skills ctegraded much more rapidly than operational skills. Flight control skills were 
acceptably retained for two months, whereas procedural performance degraded after only one 
month; flight performance degraded by a factor of ten after an interval of four months. Sitterley 
et al. ( 1 972) found similar patterns of degradation, but the procedural skill loss was not as great 
as that found in the previous study. The studies differed in that the present study used experienced 
pilots ( the previous did not) and allowed warm-up techniques prior to retention testing. 

The studies cited above have all had in common the performance and measurement of both 
continuous, tracking and discrete, procedural skills. While differing retention intervals and task 
complexities were used, the consensus has been that procedural skills are ( 1 ) more quickly lost, 
and (2) lost to a lower relative skill level than tracking skills. 

The underlying factors responsible for this differential retention loss are still unknown, however. 
Task organization differences may be responsible for this differential (see Section 2. 1 .2.). 
Alternatively, proportionally more training may be achieved by tracking than by procedural 
performance in a short period of time, because of its continuous nature, as discussed in Section 

2.2. 1 . Also, there may naturally be more transfer and practice of tracking skills in a given 
retention interval than is allowed for procedural skills (see Sections 2.2.3. and 2.3.2 ). The fact 
that procedural skills degrade more rapidly and fully than operational skills may thus be an 
emergent property of other underlying mechanisms. Different types of parameters are also 
measured in these two classes of tasks. Procedural skills are measured by accuracy in following 
the established order of a task, while operational skills are typically measured by temporal 
parameters. If procedural tasks are measured by such parameters as the time required to 
complete a sequence of switch settings, it in effect becomes an operational task, so the true 
measure of task type may lie in the parameters measured, not in the actual task itself. 

2. 1 .2. Task Qroanization 

The actual or perceived task organization, in addition to the procedural /operational dichotomy, 
influences skill degradation. Procedural tasks may have less spatial and temporal organization 
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than tracking tasks (Gardlin and Sitterley, 1972;Trumboetal., 1965;Swinketal., 1967; Noble 
etal., 1967). Unless a study manipulates an organizational variable in a highly controlled 
manner , the task type and organizational variable will be confounded. 

Perceived organization has been most commonly manipulated by altering task predictability. 

Using a procedural task combined with a tracking task, Naylor et al. ( 1 962) systematically 
manipulated task organization by illuminating light pairs in a predictable or an unpredictable 
order. The task organization had a greater influence for lesser trained conditions, in that more 
organization was required as less training was given. Trumbo et al. ( 1 965a) created four 
differential conditions of tar^t predictability in a tracking task by selecting tracking targets so 
that every one followed in spatial sequence, or every one, second, or third target was chosen 
randomly. At retention testing, subjects who received the most predictable target sequence 
retained skills better than all other levels of predictability. In fact, performance after a five 
month retention interval using the predictable sequence was superior when compared to 
performance after only one week of retention using the less predictable sequences. When 
compared with the predictable target organization, the less predictable tasks showed 80^ to 
1 OOX more error , but 1 ittle practical difference was noted between the low organizational task 
performances. Swink et al. ( 1 967) had tracking targets either appear in a deterministic order on 
every trial, or with every fourth target randomly selected. Again, the predictable target sequence 
produced superior tracking performance for retention intervals of both 3 and 5 months, in 
another horizontal tracking task , Trumbo et al. ( 1 967) produced three levels of target 
predictability, corresponding to every 4th target, 6th target, or no targets randomly assigned in a 
sequence of 1 2 target locations. All levels of predictability produced the same performance by the 
end of training sessions, but retention at the end of one week was greatest with the most 
predictable task. The lowest predictability task produced better performance than the medium 
predictability task by the end of the retention testing session, perhaps due to the fact that 
differential training had been given to these two groups of subjects; low predictability subjects 
received 1 95 training trials, while the medium predictability subjects received only 80 trials. 
Noble and Trumbo ( 1 967) reviewed a number of experiments, breaking down retention loss by 
spatial and temporal uncertainty variables. In general , the greatest retention losses were noted in 
the most uncertain task conditions, and respond strategies by subjects varied with the amount of 
task uncertainty. 

Manipulations of procedural task uncertainty have also been used to alter organization. Naylor et 
al. ( 1 968) manipulated the predictability of a secondary procedural task , while subjects were 
simultaneously performing a primary tracking task. Subjects had to depress buttons in varying 
orders, depending on which of several lights were illuminated at a point in time. Two levels of 
procedural task organization were defined: a light sequence in numerical order, and a sequence in 
random order. After retention intervals of 1 to 4 weeks, the well organized secondary task had 
lesser performance decrements than did the less organized task. In addition to decrements on the 
secondary task , performance on the primary tracking task was also influenced. The wel 1 organized 
secondary task produced superior retention on the primary tracking task after both 1 and 4 
weeks, than did the more poorly organized secondary task. 

These have clearly demonstrated that task organization directly influences both procedural and 
tracking skill retention duration. The actual task predictability was manipulated; retention may 
also be influenced by perceived organization within a task. While no studies have compared actual 
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versus perceived predictability, this is precisely the function of training. 

2. 1.3. Task Workload 

Worklool refers to the physical and cc^nitive effort imposed by an operational task. Earlier work 
in this area studied the effects of time-sharing between several visual displays, as is required 
when driving or flying, while later work utilizaJ other , joint combinations of abstract and 
operational tasks. For many reasons, some effort has been made to develop models and measures of 
workload in generic tasks (e.g., Brown, 1978). Rouse ( 1979) has reviewed information theory, 
control theory, and queueing theory models of operator workload, as well as performance, 
physiological , and subjective workload measures. Workload is of interest due to its effect upon 
skill training and retention. Tasks requiring many simultaneous control movements, as in 
aircraft control, or tasks containing long strings of branching point (tecisions, can both be 
considered to be of high complexity and required workload. Johnson (1981) suggested that the 
number of steps required in a procedural task should be a determining factor in its probability of 
successful skill retention. 

It is interesting to ask ( 1 ) whether skills for a high workload task are retained for a different 
time period than those for a lower workload task , and ( 2) whether an individual can be trained to 
perform a given task under higher workload conditions. The organization of a task may be 
considered to directly affect workload, in that the two are inversly related ( see Section 2. 1 .2. ). 

The most valid method of increasing workload has been to add a secondary task on top of a measured 
primary task. Garvey ( 1 960) trained subjects for 25 days on a tracking task, then added a 
different secondary task on three subsequent days, inclusion of the secondary tasks greatly 
increased tracking error to levels above initial , unpracticed levels. Single- task, low workload 
training did not transfer to dual-task , higher workload tasks. Briggs and Wiener ( 1 966) noted 
that higher fidelity training is required in high workload, dual-task performance, than in lower 
worklt^. This result was generalized to flight control simulators. Rudimentary flight control , 
having low time-sharing requirements, may be trainaj on low fictelity itevices, but greater 
workload requires a higher-fidelity simulator. Trumbo et al. ( 1 967) combined a tracking task 
with a verbal number anticipation task , of varying difficulties. AcWition of the secondary task 
again dropped tracking performance to below that at the start of training. Performance after 
retraining did not increase to the level shown by those not performing secondary tasks. Further , 
performance loss after 8 days of retention was independent of the introduction of the secondary 
task. Naylor etal’s ( 1962, 1 968) subjects performed in dual-task combinations of a tracking 
task with a switch-setting procedural task, with predictability in the procedural task 
manipulated. This also influenced workload, because much more attention had to be placed on 
procedural task performance in conditions of low predictability. For each of the two tasks, the low 
predictability procedural conditions produced both poorer absolute performance and poor 
retention after a 4 week retention interval. The amount of training was the greatest predictor of 
absolute performance level. Gopher and North ( 1 977) combined a primary tracking task with a 
secondary digit-processing task, and manipulated training conditions. Greater performance 
improvements from training occurred under dual-task than under single task conditions, as if the 
motivation from a harder task was beneficial to learning. 
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The definition and measurement of mental workload is a young science, but some investigators have 
clearly implicated it as a factor in training and retention. Higher workload tasks, defined by large 
levels of cognitive time-sharing, are harder to learn and retain. Large performance 
improvements, due to training, have been noted, however. Relative skill retention duration 
between single and dual-task conditions also remains to be modeled. 

2. 1 .4. Performance Measurement 

The methodology used to assess one’s performance in a task has been a continuing issue for many 
years. In fa:t, an entire issue of a journal was recently devoted to this topic ( Human Factors, 
Volume 21(2), 1 979). Measurement of procedural and operational skill following a no- practice 
retention interval has relied on measures of accuracy or speed. These dependent variables are 
then plotted as a function of time, across several experimental subject groups. Bahrick ( 1 964) 
critiqued the basic skill retention curve, suggesting that observed changes are due to varying 
sensitivities of an overserver's perceptual/cognitive system between testing sessions, and not 
necessarily due to for^tting or retention differences. Converting degradation scores into Z-scores 
may aid in stabilizing the variance between testing sessios ( Bahrick , 1 965). 

Single- task performance measurement may not capture the concurrent demand, time-sharing 
requirements of real work environments, and many have artificially combined many tasks 
together into multiple task batteries ( e.g. , Alluisi , 1 967). These task batteries provide high 
validity, precision in measurement, and sample a broad range of abilities (Akins, 1979). 
However, some have argued that the batteries are unnecessarily artificial , and performance scores 
may be defined rather arbitrarily (Chiles, 1967; Akins, 1979). 

Swezey ( 1 979) intraJuced a Bayesian-oriented utilities model to determine what criterion level 
should be achieved at the end of training sessions for gunnery trainees. This called for a 1 0-step 
decision model , identifying components of the model and calculating utility. Other, empirically- 
based methods of assignment to training programs have also been presented (e.g. , Sav^ et al. , 
1982). 

The appropriate choice of useful performance variables and methodologies is still very much at 
issue, particularly in light of the fact that the degree of observed retention is dependent on which 
perameters are used. Seme investigators have measured absolute performance, whereas others 
used difference scores, subtracting post-training performance from retention performance. Both 
types of scores are required to evaluate loss during a no-practice retention interval ( Gardlin and 
Sitterley, 1 972). Also, variance measures, in addition to means, have rarely been used as 
performance indicators. 


7.2. TRAINING FACTORS 

This section covers those factors having their primary influence on initial task training. Many 
have suggested that these variables are at least as influential as task variables in determining the 
duration of skill retention. Factors covered here include the duration of training, the distribution 
of initial training sessions, transfer of training, and fidelity issues. The central questions to 
consider concern the required amount of training (cost) to expect adequate skill retention for a 
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desired interval ( benefit). Such Issues as whether whole or part- task training are needed, and 
the degree to which already trained skills can be transferred, are relevant in answering this 
question. 

2.2. 1 . Amount of Initial Training 

A consistent finding across many skill retention studies has been that the relative amount of initial 
training one receives is a strong praiictor of level and duration of skill retention. Overtraining on 
one task may be considered insufficient training on a similar, analogue task. Also, overtraining 
can be very expensive if it requires significant high-fidelity simulator time. Due to the 
importance of this issue, reference is made to the figures presented earlier. 

Using only a 1 5-step procedural task, Ammons et al. ( 1 958, Experiment 1 ) trained subjects for 
either 5 or 30 training trials. Initial completion time for the two subject groups at the end of 
training was 0.4 and 0.2 minutes for the 5 and 30 trial groups, respectively. After a 2 year 
no-practice retention interval, the 5 trial training group performed the task in 1.3 minutes (a 
3-fold increase) and the 30 trial group performance rose to 0.5 minute (2.5-fold increase). 
Proportionally fewer trials were required for the 5 trial subjects to regain their initial 
performance than were required by the 30 trial subjects. This training difference is plotted in 
Figure 1(B). In their tracking task (Ammonsetal, 1958, Experiment 2), subjects were trained 
in aircraft control ( using an airplane model) for a period of either 1 or 8 hours. Results from 
this study, plotted in Figure 2(A), showed that skills increased somewhat over retention intervals 
of up to 2 years. This increment was approximately equal for both training durations, but the 8 
hour group maintained about a 2 %- 1 0^ superiority over the 1 hour group performance 
throughout all retention intervals. While the superiority of longer training remained clear in 
this task , the reason for the performance increment did not. 

Mengelkoch et al. ( 1 960 , 1971) trained inexperienced subjects for either 5 or 1 0 daily 
50-minute sessions, in an aircraft flight simulator. As shown in Figure 1(C), the two groups had 
approximately the same skill ctegradation, after a 4 month retention interval , on a procedural task 
(losses for the 5 and 10 trial groups were, respectively, 20^ and 1 6^ of training levels). The 
effect of greater training was in achieving a nearly 20S5 increase in initial training level on the 
procedural task. The flight control or tracking portions of Mengelkoch’s study only showed 
significant skill degradation, from both training groups, for the airspeed error parameter. The 5 
and 1 0 trial training groups showed altitude error increases of about 1 0 feet over the 4 month 
interval (see Figure 3(A)), or about 20^-30^ increase from initially trained levels. This loss 
was significant for the 5 trial group, but not the 1 0 trial group. Like performance of the 
procedural task, the primary difference between training duration groups was in the performance 
level at the end of training, rather than the relatively long skill retention. The fact that the 5 
trial group retained their skills to the same magnitude as the longer trained group is meaningful. 

Naylor et al. ( 1 962 , 1 968) trained subjects on a dual tracking and procedural task for either 2 
or 3 weeks of daily sessions. The longer training produced relatively superior performance at the 
end of both 1 and 4 week retention intervals when compared with end of training scores, but only 
in omissive errors. The com issiveerrrors here did not significantly differ. Figure 1(F) shows 
the omissive errors from these subjects. Naylor et al. ( 1 963) used the same task and trained 
subjects for either 5 or 1 0 daily sessions (one or two weeks). Only comissive errors here 
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differed as a function of training duration. The tracking performance from these cuai-task studies 
showed similar trends. Three wreks of training produced less integrated tracking error and 
greater skill retention than two weeks of training. Integrated error was also significantly lower 
. for 2 weeks of training, compared with 1 week of training. The 1 week trained group did. 
however, display increases in skill during the retention interval, not shown by the 2 week group. 

Using a simple abstract line tracking task , Hammerton ( 1963) varied desired initial training 
criterions, as opposed to varying initial training durations. A 55? criterion group required 3 
successive daily elapsed target acquisition times that did not differ at the S% level of significance. 
Likewise, those in the 1 % criterion group had 3 successive scores not differing at the 1 % level. 
Retention of tracking skill after 6 months is shown in Figure 2(C). While two groups did not 
differ in mean time after training, the 5^ group required more than 1 0 additional seconds than 
the 1 % group after 6 months. This difference was both statistically and practically significant. 
Even the I % group exhibited significant skill degradation, in spite of their extensive training. In 
this study, the additional training to achieve the 1 % criterion was 9 to 1 7 days beyond the 8 to 22 
days required for the 55B criterion. .This ctegree of overlearning significantly decreased, but did 
not entirely alleviate, skill degradation. 

Trumboetal. ( 1965a) presented a similar line tracking task to 250 subjects. Half were trained 
for 50 trials and half for 1 00, over a 3 day interval. As shown in Figure 2(D), both training 
groups showed significant retention losses ( increased tracking error) over a 5 month interval. 

The task organization was a stronger predictor of retention loss than was the absolute training 
duration. The high training level group did exhibit less skill loss than the low training group at 
all tested retention intervals. A subsequent analysis of separate skill retention components, from 
only the 1 00 training trial group, demonstrated that the best tracking subjects retained temporal 
accuracy (as measured by lead or lag time) better than spatial accuracy (as measured by 
percentage of over or undershoot errors). Thus, temporal as opposed to spatial training may be 
more important in retaining tracking skills over a long duration. 

In a complex simulation of an Apollo mission, Youngling et al. ( 1 968) trained their subjects for 
either 60 or 1 20 days. The overall skill retention, measured by time on target, was twice as 
great for the 1 20 trial group ( 5.5 seconds) than the 60 trial group ( 2.4 seconds on tar^t). 

Hagman ( 1 983) summarized several skill retention studies performed in military contexts. 
Hagman ( 1 980) varied the number of times Army personnel repeated a procedural electrical 
alternator output test during training, increased task repetition, from 1 to 4 times, reduced 
performance time and errors by approximately constant amounts during training and after a two 
week retention. Increasing repetitions linearly increased performance until the 4 repetition 
duration training. Schendel and Hagman ( 1 980) trained Army groups to either one correct 
performance or two correct performances in the disassembly and assembly of an M60 machine 
gun. After an 8 week retention interval , the greater trained group committed fewer errors than 
the lesser trained group. Goldberg et al. ( 1 98 1 ) trained Army personnel to either I or 3 
successive correct performances of boresighting and zeroing the main gun of an M60A 1 tank. 
Again, higher performance after a 5 week retention was achieved by the more highly trained 
personnel. Schendel and Hagman also varied the time at which extra training was actually given. 
One group of subjects received extra task repetitions during the intitial training, while a second 
group received theirs half-way during the retention interval . at 4 weeks. They found no 
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significant difference between the two modes, implying that it is more cost effective to supply all 
training at one time. 

Most of the above studies have been in agreement in that skill retention is a function of training 
duration. Many questions still remain, such as whether training duration is more or less 
important than task organization or the retention interval in determining the magnitucte of skill 
retention. 

2.2.2. Training Distribution 

The way in which a given amount of training is distributed over a time interval is also predictive 
of skill retention success. Fleishman and Parker ( 1 962) manipulated the method of retraining 
following a no-practice retention interval. A massed practice group received 4 practice sessions 
within a 2 hour period, while a distributed practice group received the same level of training, 
spread across 4 subsequent days. The distributed training group outperformed the massed practice 
group by the end of retraining, but both groups performed equally well after another 1 week 
retention interval. Thus, the distributed practice may have had its effect on temporary 
performance factors. H^man and Rose ( 1 983) reported that insertion of time between 
repetitions of a task increases skill retention, but the problems associated with the disruptive 
training may overshadow their benefits in actual tasks. Hagman ( 1 980) compared massed versus 
spaced training for Army electrical alternator testers and repairers. The massed training group 
took 5 1 % longer and made 40* more errors than the spaced group. Schendel and Hagman ( 1 980) 
either gave task repetitions as part of initial training or one month later, and found no difference 
in ability after a two month retention interval. Spacing of task trials and/or sessions may be 
helpful , but there is some question as to whether its effectiveness varies with task proficiency 
level ( Hagman and Rose, 1 983). A model is needai here, and must consider the initial level of 
post-training skill proficiency, which also determines the required frequency of task repetitions. 

2.2.3. Transfer of Training 

Training transfer refers to the ability of a trained skill to generalize to a new setting. From cost 
considerations, positive skill transfer means that performance on a task can utilize already 
trained skills, saving time and money. Also, highly generalizable skills can easily be used in new 
settings or situations, for which no training has been constructed. The term validity refers to the 
degree to which training readies one for performance on a task, and is a measure of training 
transfer. 

Briggs and Wiener ( 1 966) trained subjects to perform an abstract two dimensional tracking task , 
and transferred this training to an easier task requiring the setting of a control knob. High 
fidelity training ( achieved through proprioceptive control feedback) was only required when the 
transfer task reqiuired a high level of time sharing, by forcing constant positioning. Thus, when 
proprioceptive cues and high levels of time sharing are required in a task, the training program 
should be of high fidelity. 

Reid ( 1 975) assessed training transfer from a formation flight simulator to actual formation 
flying. Untrained, formation simulator trained, and aircraft trained pitots were compared in 
actual flight formation flying. Evidence of positive simulator skill transfer was obtained, as these 
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pilots did not fly significantly differently from conventionally trained pilots. The simulator 
provided the same c^ree of training as the flight sorites, indicating a high level of skill transfer. 
Carter and Trollip ( 1 980) showed that training transfer between skills may be compared by 
plotting iso- transfer curves between pairs of skills and noting maximum transfer pairings. An 
operations research technique, the Lagrange Multiplier, was useful for determining costs and 
benefits of training. 

Validity of training can also be evaluated by the method proposed by Goldstein ( 1 978), who used a 
four level approach to evaluation: ( 1 ) Training validity , determined by trainee performance 
relative to standard training criteria, (2) Performance validity , measured by transfer of job 
performance, using criteria from the actual job, ( 3) Intra-oroanizational validity , measured by 
the performance of a group of new trainees based on the performmance of a previous group , and 
( "4) Inter-oroanizational validity , meesured by the degree to which a training program validated 
in one organization can be usai in another organization. All of these levels must be evaluated to 
determine the effectiveness or validity of a given training program. Moving from the first level to 
the fourth, an increasing number of variables influence the success of training. Also, the 
necessary level of complexity in a training needs analysis must depend on the final goal of training. 

If one’s goals do not reach beyond the second level , for instance, there is no need to consider levels 
3 or 4. Such a structural assessment of validity is required to transform training needs 
assessment from art to engineering. The validity of training apparatus, according to Crawford and 
Crawford ( 1 978), lies more in the manner in which it is used, rather than in the degree of its 
similarity to actual equipment. These investigators substituted conventional hands-on practice 
for part-task computer-based training on the use of an integrated control panel in an 
anti-submarine airplane. Control subjects performed on a high fidelity simulation of the control 
panel , while experimental subjects were trained using a graphic simulation on a touch screen 
display. The experimental subjects completed more tests, in less time than the control subjects. 

The computer-based training was found to provide at least as good ski 11 acquisition, in less time 
and at lesser cost, than the full simulator training. A cost analysis indicated a substantial 
two- thirds cost savings over the conventional training method, much of which was due to a smaller . 
number of instructor man-hours. 

Adams ( 1 979) contrasted the shortcomings of two methods of rating flight simulators for aircrew 
training. A transfer of training study measures the relationship between achieved task competence 
and proficiency on the flight simulator, while the rating method requires an engineering and 
experienced pilot assessment of hardware and flight similarity between the simulator and actual 
aircraft. Adams reviewed many studies with the thesis that both techniques are flawed. This does 
not mean that simulators are not useful , though. Humans require the perceptual- learning, 
stimulus- response learning, and feedback provided by simulator sessions. In addition, simulators 
successfully motivate trainees better than lower fidelity learning environments. Because 
simulators are based on these well-founded principles, simulators need not be evaluated for their 
effectiveness; this may be taken on faith (Adams. 1 979). 

As part of his procedural control setting training study , Johnson (1981) measured skill transfer 
by manipulating the sequential steps of the original task. In two experiments, low-fidelity 
paper -and- pencil training transferred very well to the new operational task. Although the two 
tasks likely utilized similar skills, this was further evidence of the utility of analogous tasks for 
training purposes. Validity ctetermination and training needs assessment are still very much 
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debated topics; this section has only briefly introduced these issues. 


2.2.4. Training Fidelity 

A large and detailed lieterature has developed, concerning required fidelity in task training. At 
issue here is whether full fidelity is required to achieve adequate and retained skill performance. 
Considerations such as whether open or closed- Icop simulation control is required, whether 
simulators must necessarily move, or whether aifequate whole task performance can be achieved 
from part-task training have been aidressed. In this paper, fidelity refers to the degree to which 
a training device can mimic an actual task of interest, such as flying an aircraft. 

Naylor et al. (1963) manipulated the type of rehearsal subjects received in a dual tracking and 
procedural task. The procedural task consisted of 9 pairs of lights to which responses had to be 
made, and the tracking task was a three dimensional meter nulling task, in which roll, pitch, and 
yaw were simulated. In whole- task training, subjects practiced with both tasks simultanoDusly , 
as required for the measured performance task. Part-task training required separate practice on 
each task. Retention differences between the training conditions were significant on tracking 
performance, with the part- task rehearsal group displaying inferior performance. Whole- task 
rehearsal was also superior for procedural task performance, but this superiority lessenaj over 
the retention testing. Whole- task rehearsal was superior with a small amount of training ( up to 
5 days), but after 1 0 days of training, the two types of rehearsal were not signifianctly different. 
Naylor and Briggs ( 1 963) manipulated rehearsal conditions on this procedural switch setting 
task. Whole- task rehearsal consisted of repeating the original task half-way through the 2 month 
retention interval. Part-task rehearsal conditions consisted of either ( 1 ) spatial rehearsal , with 
stimulus events occurring at equal temporal intervals, or ( 2) temporal rehearsal , with stimulus 
events occurring at varying times as in the original task, and stimuli appearing in a regular 
spatial order. The whole- task group produced far fewer omissive procedural errors than the 
part-task groups upon initial retention testing. The whole-task and part, temporal-task 
rehearsal were superior to spatial -task rehearsal when considering comissive errors. Thus, 
whole- task rehearsal here was best, closely followed by part-task temporal rehearsal in 
upholding skill retention over a 1 month retention interval. 

Fleishman ( 1 965) presented a multidimensional tracking task to inexperienced Air Force 
trainees, with the objective of predicting whole- task performance from various combinations of 
part-task training. The performance measurement device contained display dials for heading, 
altitude, bank, and airspeed, which all had to be simultaneously centered. Subjects were first 
proficiency trained on one dial , then two dials, then the entire task. Correlations between 
one-dial , part-task performance and whole- task performance ranged from .46 to .54 across the 
subjects. Between two-dial , part-task performance and whole- task performance, the range was 
.63 to .70. Multiple component practice was a better predictor of whole- task performance than 
single task performance in this multidimensional task. In addition, the multiple component 
performance was at least as predictive as linear combinations of the single task performances. 

The greatest correlations ( .74) between part-task and whole task performance were found with 
linear combinations of two, two-dial practice. In this work , the actual task components that were 
used was less important than the fact that simultaneous practice had occurred. All predictive tasks 
here were part-task practice, but this investigation suggested that a continuum exists in training 
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effectiveness, between various integrative or combinatory levels of sub-task performances. 

In his space vehicle approach and landing simulator. Sitter ley ( 1 974) varied the fidelity of pilot 
retraining methods, following a 4 month no-practice interval. The number of visual cues present 
in the training session strongly predicted the level of performance achieved, and the level of visual 
cueing was independent of the fidelity of the simulator session. Static photographic training was 
superior to open-loop , dynamic training. All cues, however, present in the static pictorial method 
were present in the dynamic display training, so the total number of cues was not soley predictive 
of training effectiveness. As stated by Sitter ley, the most important element in these training 
alternatives was the presentation of efficient cues which assist®) pilots in recalling their basic 
flight experiences. Thus, open- loop, static training methods may actually be superior to more 
costly methods, given careful training program design. Trollip ( 1 979) compared a computer- 
based with a simulator- based training program for aircraft flight control. Control subjects were 
trained in a flight simulator, while experimental subjects were trained on a plasma touch screen 
terminal with an attached hand controller. The computer-based training produced significantly 
fewer critical errors and better flight control than the simulator. This trend was identical in both 
no wind and crosswind flight patterns. When generalizaing flight control to a new prcx:edure. no 
difference was found between the two methods of training. The computer-trained subjects 
performed better, learned quicker, and made fewer mistakes than their control counterparts, it 
allowed students to develop better mental images of the ideal flight characteristics. Johnson 
(1981) also found that training requiring large use of mental imagery cues can produce the 
highest level of skill retention. Even in high complexity flight simulation and control 
environments, the highest level of fidelity is not required. Sitterley and Berge ( 1 972) and 
Sitter ley ( 1 974) concluded that static rehearsal or training may be superior to the d/namic, 
higher fidelity rehearsal because of the artificially increased importance of visual cues. 

One variable significantly influencing fidelity in aircraft simulations is flight motion. This has 
been a controversial topic over the past decade, with many insisting that motion cues are 
unnecessary for general aviator training. Caro ( 1 979) discussed this issue with reference to two 
different motion cues, maneuver motion and disturbance motion. The former motion cue refers to 
those motion changes initiated by the pilot, whereas the latter refers to those cues initiataj outside 
the immediate control loop, such as turbulance or engine effects. While maneuver motion moves 
the aircraft platform , it does not cause important alerting cues provided by the disturbance cues, 
which lead to quicker and more accurate pilot control of the simulator. No motion, on the other 
hand, is required if the simulated aircraft is easy to control and relatively stable (Caro, 1 979). 
Thus, required fidelity here was based on a logical analysis of task training requirements. 

In the monitoring and control of a procedural industrial operation, Johnson (1981) utilitzed 
three different training strategies: ( I ) conventional , full-fidelity practice on the actual task , 

(2) medium-fidelity reproduction stud/ of photographs, where the subject was allowed to draw on 
the photos, indicating his procedural responses, and (3) low-fidelity, blind practice, where the 
subject was allowed to stud/, but not write on photos of the control equipment. Although the 
conventional strategy provided the quickest learning time (the blind practice required 1 .5 times 
as long to reach criterion), the conventional and reproduction training did not produce different 
control setting errors after a 3 month retention interval. This illustrated that the highest fidelity 
training is not required in procedural tasks. Johnson and R o us e ( 1 982) also found that low and 
medium fidelity training in aircraft power plant troubleshooting is very competitive to high 
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fidelity simulation. The highest-fidelity method in this study included training on the actual task , 
medium fidelity required a special power plant simulation, while the lowest fidelity condition 
utilized videotaped lectures and live quizzes. Victeo training produced the greatest performance 
with all simulataj failures, and the original task and simulation were similar in their 
effectiveness. From a cost consicteration, the low and moderate fictelity ctevices provided sufficient 
problem solving experience to effectively compete with the conventional training methods. 

According to guidelines posed by Cream et al. ( 1 978), the specification of required training 
fittelity appears to be art. rather than engineering. They stated that ( 1 ) essential and nonessential 
aspects of controls and displays must be differentiated, in that many of these elements are not 
required for proper training. ( 2) The choice of fidelity is more complicated when dealing with 
displays rather than simple Indicators. Also, no rigorous decision-making procedures have been 
developed in the area of cost/benefit fidelity analyses. Though experiential -based fidelity 
definintion has been used for many years, no useful guidelines exist for the development of 
training for new tasks. 

Perhaps the degree of required fidelity is a function of how little is understood of the processes 
required to carry out a given task. Seemingly complex tasks may only be combinations oi a limited 
number of combined operations. On these, perhaps part- task training would be sufficient, if the 
actual components could be identified. Full-fidelity, whole- task training would then only be 
required in very complex tasks. 

2.2.5. Adaptive Training 

Both ground-based and on-orbit training systems should be adaptive to trainee performance, for 
maximum efficiency. This research area has recently shown substantial growth, as a result of the 
development of specific adaptive systems. Machine-controlled adaptive training simply automates 
a skilled instructor , by modifying the training stimuli as a function of trainee performance. 
Training efficiency is maximized, because effective learning only takes place when training is at 
an appropriate level of difficulty ( Kelly, 1 969). Adaptive learning curves typically show a 
linear relationship between ability and time, as opposed to conventional training curves. 

The marker variable for training adaptation may vary, depending on the nature of a task. Johnson 
and Haygood ( 1 984) utilized performance on a ssxindary light recc^nition task to adapt the 
difficulty of a primary tracking task. Williges and Williges ( 1 978) concluded that the most 
effective adaptive parameter should be a multivariate combination of several performance skills. 
Matheny ( 1 969) argued that the time lag between a system response and an operator’s subsequent 
performance should serve as the adaptive parameter in general man-machine systems. While 
many parameters have been used, they have all served the function of varying the difficulty of a 
primary task. 


7 RFTFMTION INTERVAL FACTORS 

This section discusses those skill degradation issues directly related to the retention interval , 
between the initial training and actual performance. Two factors here have important 
implications for skill retention. ( 1 ) The duration of the retention interval has been extensively 
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studied, using intervals from a few minutes to well over two years. (2) The nature of activities 
performed during the retention interval influences skill degradation, just as practice of any task 
should do. Most investigators in this area have concluded that learned skills regularly degrade 
with increasing no-practice retention intervals. Bahrick (1964) has, however, disputed this 
concept, claiming that retention curves based on anticipation, recognition, or free recall reflect 
changes in one's sensitivity from session to session. This complaint, however, was only drawn 
against measures with only right/wrong responses. Many procedural studies, using other 
continuous measures such as completion time, have indeed found evidence for regular skill 
degradation over time. 

2.3. 1 . Length of Interval 

Retention intervals are an important consideration when designing refresher training on lengthy 
crew missions. The many skill retention/ ctegradation studies have all shown large performance 
decrements upon post- retention testing, when no retention practice is allowed. Estimates of the 
percentage skill loss at various time intervals allow an empirically determined estimate for the 
frequency of refresher training, given that retraining is required when performance falls below a 
set criterion. Longer intervals are generally accompanied by greater loss in skills, but this is 
very task specific. Ideally, this review should provide an overall skill retention function, 
mapping percental of skills retained versus retention duration. However, reality dictates that 
between-study variations make such generalizations and models very hard to achieve. The most 
important question that can be answered here is whether a constant degradation across many tasks 
is found, with all other factors being equal. A cursory analysis of the results in Figures 1 through 
3 indicates that skills degrade with time when not subjected to interim practice, and that the level 
of degradation reaches as asymptote in some studies. 

Procedural Tasks . With few exceptions, procedural performance is marked by consistently 
increasing decrements with prcqressively longer retention intervals. Neumann and Ammons 
( 1 957) found that a one year, 90^ loss in post-training performance was about the same as 
initial performance at the start of training, but proficiency was quickly regained upon retraining 
(Figure 1 A). On a task of nearly equal complexity, Ammons et al. ( 1 958, Expt. 1 ) found a 2 to 
3- fold increase in task completion time after a one year interval , which did not appreciably 
increase after 2 years of retention (Figure IB). The magnitude of relative skill d^raiation was 
the same here, regardless of the original number of training trials. Mengelkxh ( 1 960 ; 1 97 1 ) 
also found that relative magnitude of skill loss was independent of the amount of training ( F igure 
1C), where subjects showed a 20^ decrease in correct procedures after a 4 month retention. In 
an extremely complicated 1 69 hour mission simulation, Cotterman and Wood ( 1 967) found 
relatively small degradation over a 3 month retention when only a single parameter was 
considered (Figure 1 D). The probability of successful performance over the interval fell by about 
.03. However , when all parameters in all phases of the simulation were considered, the 
probabilities dropped significantly over the interval; initially at an average of about 0.6, it fell to 
about 0.4 after the retention interval , suggesting that a failure was highly likely in some mission 
phases. This study was flawed, however , due to uncontrolled retention interval activities and 
small sample sizes (Gardlin and Sitter ley, 1 972). The performance of complex control and 
emergency procedures clearly degrade in required procedural time after 6 months, and Sitterley 
et al. ( 1972) notala 4.5 fold increase after 4 months of retention (Figure IE). Johnson ( 1981 ) 
measured the time required to set controls in an 87-step procedural task, and found a mean time 
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of 8 minutes after training had increased by 50^ to 1 2.8 monutes after about 2.5 months. As 
gross estimates of the magnitude of skill degradation in procedural tasks, 20^ to 50^ degradation 
in 3 to 6 months, and 50^ to 1 0OX or more in more than 6 months may be made, based on the 
above data. 

Investigations over shorter retention intervals, up to 1 month, have not found consistent skill 
degradation patterns. For example, using their switch setting task, Naylor and Briggs ( 1 963) 
found a 20^ decrease in omissive errors, but a 233^ increase in comissive errors after 1 month 
(Figure IF). Likewise, using the same task paired with a tracking task, Naylor etal. (1968) 
found the only retention degradation in comissive errors from the subject group with lesser 
training and low task organization (Figure IF). Thus, skill retention of less than one month is 
harder to predict than longer durations, and may be dependent on many other task factors. 

Simple Tracking Tasks . Performance on tracking tasks have not as a rule shown the predicatble 
and regular retention decrements shown by prxedural task performance. In controlling the flight 
characteristics of a mcxlel airplane, Ammons et al. ( 1 958, Expt. 2) found only a small 5® 
decrease in tracking time on target, between 1 and 24 months ( F igure 2A). Theseslight skill 
decrements followed slight but significant increments between the end of training and 1 month 
retention. Beyond the absolute performance difference at training, the duration of training did not 
alter the relative decay rate of skills. Hammerton ( 1 963) used a measure of target acquisition 
time in a tracking task (Figure 2C). By varying the allowable amount of session- to-session 
variability at the end of training, differential skill degradation was observed at a retention 
interval of six months. The looser criterion group showed a 3-fold increase in tar^t acquisition 
time, whereas the tighter criterion showed only about a 1 -fold increase. Thus, in tra:king, 
regularity of performance as well as absolute m^nitude appears to predict the d^ree of skill 
degradation. Over a short, 1 month retention, Naylor et al. ( 1 962; 1 968) demonstrated 
statistically significant skill loss at 2 levels of training duration and two levels of task 
organization (Figure 2E). Relative losses averted about 1 656 at one week, and 44% at one month. 

Four studies cited here used integrated tracking error as a dependent measure. Fleishman and 
Parker ( 1 962) had two groups of subjects perform tracking tasks. A group receiving no formal 
training showed no performance decrement at up to 1 4 months of retention (Figure 2B ). A second 
group who received formal training on the task showed a 1 -fold increase in error after one year , 
but then showed a 4-fold increase after 2 years. Trumbo et al's ( 1 965a) subjects showed 
virtually no performance decrement with intervals up to 5 months, when the task was 
unpredictable, with random targets located on every trial (Figure 2D). However, when the target 
position was more predictable, post-training tracking error was about 50% less than in the 
predictable condition. Retention intervals of 1 and 5 months produced large decrements in 
performance, upwards of 50%-60% from training levels. Equal degradation rates were found for 
both 50 and 1 00 training trial conditions, with the latter condition always producing better 
performance. Trumbo et al. ( 1 965b) also demonstrated a 24% skill loss after a 1 month 
retention interval (Figure 2D). Swink et al. ( 1 967) also manipulated task predictability and 
training duration in a tracking task (Figure 2F). The retention interval in this study, however, 
was unrelated to tracking error , as tracking ability did not degrade over a 5 month no-practice 
interval. Roehrig ( 1 964) had several subjects stand on a small balancing platform , and measured 
the time duration that they could balance to within ± 1 .5* of horizontal. After a 50 week hiatus 
from the task, all subjects demonstrataJ performance at least as great as shown at the end of 
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training. Much like the well-known fact that “one never forgets how to ride a bicycle," this task . 
once trained, seemed to trigger the same skill retention. Perhaps there is much ability transfer 
from balancing in ordinary walking (once the body has been trained to use the muscle groups 
required by the task) , and subjects were unknowingly practicing the task. This conversly 
suggests that we can forget how to ride a bicycle, if balancing Is not normally practiced, as in 
bed-ridden or spacefaring individuals. Research needs to first be conducted to determine which 
tasks are dependent on balancing practice in a gravity environment, as was implied by an earlier 
report ( National Academy of Sciences, 1 972, p. 245). A taxonomy of tasks, organized by gravity 
(tependency , should be developed. 

Simple tracking performance, not requiring a iar^ number of simultaneous decisions and 
elements of concious cognitive control , does not appear to degrade as regularly and predictably as 
procedural skill performance. While some studies did find large decrements after a few months 
(e.g., Fleishman and Parker, 1 962; Hammerton, 1 963), others have found no evidence of skill 
degradation (e.g., Ammons et al., 1958; Swink etal., 1967). Clearly, in simple tracking, other 
factors are important in determining the degree of retention loss. From those studies cited here, 
those factors must include duration of initial training and task organization or predictability. 

Complex Tracking Tasks. In those few studies using tracking tasks in higher complexity fl ight 
control contexts, performance on at least one parameter has shown strong interval- related 
degradation. Mengelkoch (1971) found significant increases in altitude error after a 4 month 
retention interval (Figure 3A). The skill degradation rate was equal between the 5 and 10 
training trial groups, but the 1 0 trial group consistently made about 20^ less error than the 5 
trial group. Sitter ley and colleagues also used altitude error, among many other parameters, in 
their space vehicle simulations. Sitter ley and Berge ( 1 972) measured a 2-fold increase in error 
over a 6 month interval , whereas Sitterley ( 1 974) found a 5-fold increase over a 4 month 
retention interval (Figure 3B). In an alternate parameter, Sitterley and Berge measured a 55^ 
increase in integrated pitch error after a no-practice retention of 6 months duration (Figure 3D). 
When measuring ability to null complex movements in the display within a simulator. Youngling 
et al. ( 1 968) found a nearly linear relationship between the len^h of retention and performance 
loss ( Figure 3C). Here, total tracking time on target decreased from approximately 40 seconds at 
training to about 33 sa»nds after 6 months, or a 20^ loss. Percent flight skill degradation , a 
composite of many flight parameters, is perhaps the best overall measure of flight performance. 
Sitterley et al. ( 1 972) noted a 400^ daorease while Sitterley ( 1 974) noted a 200^ decrease in 
skills over a 4 month interval (Figure 3E). Clearly, flight skills are very sensitive to 
no-practice retention intervals, and may degrade by 4 or 5-fold over a few months. 

2.3.2. Interpolated Activities 

Practicing critical skills during the retention interval does aid retention performance. The 
relevant issues here are ( 1 ) for which types of tasks does practice aid, ( 2) what are the practice 
task transfer characteristics to the job performance task, and ( 3) are these dangers of negative 
task transfer; i.e. , practice that can accelerate performance degradation. 

Brown et al. ( 1 963) required subjects to perform Naylor’s switch setting task as well as a three 
dimensioal tracking task. Rehearsal on these tasks was manipulated on 4 ^s of a 1 5 day 
retention interval. For the tracking task , rehearsal greatly aided retention performance, but the 
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fidelity of the rehearsal did not alter this result. Performance decrements were effectively erased 
with task rehearsal. On the procedural task, rehearsal had influence on both commissive and 
omissive errors. For both types of tasks, sufficiently long original training attenuated the 
positive effects of rehearsal. When training was more limitied in scope, practice during the 
retention interval lead to large increases in skill retention. Naylor and Briggs ( 1 963) tested 
variations in type of rehearsal, on retention performance of their switch setting task. The four 
rehearsal sessions oxurred mid-way in their 25-day retention interval. One group received 
actual task rehearsal , one received no rehearsal training, and two groups received either 
part-task temporal or part-task spatial rehearsal. On omissive errors, the actual task group 
committed about half the errors of the other three rehearsal groups. On commissive errors, the 
actual task and temporal part-task groups were superior. Whole- task rehearsal was superior to 
part-task rehearsal conditions. Spatial rehearsal was barely any better than no rehearsal at all , 
but the time dimension may have been more difficult than the spatial dimension in this task. 
Trumbo et al. ( 1 965b) compared verbal rehearsal with no rehearsal in a tracking task , over a 
one month retention interval. Part-task rehearsal required subjects to verbally repeat the 
tracking target laation, referencing to its presented numerical location. On this task, no tracking 
mean performance retention difference was due to rehearsal , but a greater variability in tracking 
in the rehearsal group than non- rehearsal group was found. 

Sitter ley and colleagues investigated the type and distribution of rehearsal for their complex 
spacecraft simulation tasks. Sitter ley and Berge ( 1 972) presented both emergency prtxedural 
and flight control tasks to thier inexperienced subjects. After four months of inactivity, both task 
performances were greatly degraded, beyond the minimal proficiency level. As part of their 
experimental design, two subject groups recieved static rehearsal training during the retention 
interval period, where a session consisted of a review of the flight training manual , photographs of 
the cxkpit environment, and a written evaluation test. The static rehearsal greatly counterKl 
performance degradation for the procedural task, at both 3 and 6 month intervals. The interim 
rehearsal aided performance as much as allowing dynamic warmup immediately prior to the start 
of retention testing. The continuous task , on the other hand, responcted differently to rehearsal and 
warmup training. At a 6 month retention interval, static rehearsal was insufficient to maintain 
performance in all control skills; d/namic warmup was required to insure reliability. The 
regular rehearsal sessions were, however , adaiuate for skill maintenance over the 3 month 
retention interval. Thus, long retention intervals roiuire both rehearsal and warmup for flight 
control , but only require static rehearsal for procedural tasks. Using an even more complex space 
vehicle approach and landing under both visual and instrument flight conditions, Sitterley et al. 

( 1 972) added a condition of dynamic rehearsal training, in addition to improving the static 
rehearsal training method. The improved static method utilized photographs of flight instruments 
and scenes at critical times, and allowed the subject to sit in the simulator cockpit for 
refamiliarization prior to testing. The dynamic rehearsal condition included the above static 
rehearsal , then the pilots were allowed to view three dynamic flights from the cockpit, in an 
open-loop fashion. The pilot still did not have direct interaction as he would have during warmup 
practice. Results showed that, like Sitterley and Berge ( 1 972), static rehearsal improved skill 
retention, but required dynamic warmup practice for adequate proficiency. The dynamic 
rehearsal prevented skill degradation for alt procedural and flight control tasks, with the visual 
flight control portions receiving the greatest training benefit. The static method was only slightly 
worse than the dynamic method in retention of flight control or continuous skills. 
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From a cost/benefit viewpoint , Sitter ley (1974) claimed that the static rehearsal methcc nad the 
greatest development potential. To test an advanced static rehearsal version, Sitter ley presented 
more pictorial information along both normal and sub-nominal flight paths, and enhanced pilot 
involvement to reinforce critical perceptual cues in the visual environment. The static rehearsal 
was presented in a booklet format for self-study by the pilot. All retention testing was preceded 
by a 40 minute slide show of real time cockpit views of the approach and landing. After a 4 month 
retention interval , the advanced static retraining countered all skill degredation , more so than 
even the dynamic rehearsal of the previous study. Sitterley suggested that the carefully 
structured visual cues at critical moments were sufficient to key appropriate pilot responses. 

The above studies have nearly all confirmed the utility of retention interval practice in countering 
both continuous and procedural skill degradation. When training is of insufficient duration, 
rehearsal methods can be substituted to some ctegree. The rehearsal training should be carefully 
designed to provide minimal cues required to successfully perform the task of interest. 
Experiments by Sitterley have demonstrated that rehearsal for complex flight control does net 
have to be closed loop and high fidelity. So long as the important visual cues have been provided, 
open- loop, pictorial reviews may ad^uately be substituted for the real task. Of those studies 
reported here, none have concluded that rehearsal degrattes retention when compared with no 
rehearsal. However , none have systematically varied rehearsal tasks so as to provide negative 
skill transfer. 


3. SPACE MISSION TRAINING 

It is frustrating to study empirical research on task training factors, then consider the techniques 
that are actually used to define training requirements. Cream et al. ( 1 978) outlined "systematic 
methods" usually used to speoify training objectives in a specific task. First, the behavioral skills 
and knowlec^ required of graduated trainees are identified. Next, these are matched against the 
actual ability of new students. The identified differences then define training requirements of a 
program. As recognized by Cream et al. , a lack of task analysis data for defining training 
requirements exists, especially with new systems. They recommended seeking out analogous 
tasks, again avoiding the issue of task analysis. Such experiential- based development can be a 
costly error in new systems development, where many competing task factors can eclipse 
unforseen interactions. This report defends the need for a quantitative model of training 
requirements as a function of task factors. Of course, much research will be required to specify 
this model. 

This section presents a brief outline of operational space mission training at NASA, for the 
purpose of establishing a foundation from which to define Mars mission training requirments. 

3.1. CURRENT MISSION TRAINING 

Training programs for Space Shuttle missions proceed from part-systems teaching and practice to 
more complex, fully Integrated simulations. A typical astronaut training program currently 
requires about 5 years from start to flight. Training starts with stand-alone equipment, then 
proceeds to joint, integrated mission simulation. Initially, workbooks and self-paced computer 
aided trainers are used to gain knowledge and proficiency on specific systems. Single and multiple 
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part-task trainers are then used to gain required psychomotor proficiency. Examples of these 
trainers are specific shuttle control panels and the RMS. An underwater weightless training 
facility, and airborne parabolic flights may be used for specific procedure training. Shuttle 
systems simultators are now used for many tasks. The multiple- task shuttle simulator may be 
tied with one or more flight centers in partial-mission simulations. The full mission may then be 
simulated by tying in all payload customers and control centers. For very complicated maneuvers, 
this joint simulation may even be repeated on-orbit just prior to the actual performance. 

Very little active astronaut performance measurement is currently conducted, once selected 
(Akins, 1 979; Niccgossian, 1 984; Christensen and Talbot, 1 985). Throughout the literature, a 
prerequisite for the evaluation and developpment of training procedures is unbiased performance 
data (see Goldstein, 1978; Cream etal., 1978;Swezey, 1979). As an illustration of this, 
consider a part-mission simulator session. Trainers prepare scripts of system failures that 
occur at regular intervals, every few minutes. The task of the trainees is to make educated 
di^noses and decisions, while controlling the spa:e vehicle. After completing each simulation 
run, the training scripts are reviewed with the trainees, pointing out mistakes that were made. A 
new run then begins, with the hope that lessons have been learned. While trainees clearly learn 
from this training scenario, limitations in vehicle design or human capabilities are not collected. 

A series of failures made by all trainees would not be noted; such failures are valuable data to use 
in the redesign of systems. A separate performance monitoring system . invisible to trainers and 
trainees, would be useful here. 

Soviet cosmonauts also utilize simulators and part-task trainers, but their training philosophy 
differs in a basic way from U.S. philosophy. Rather than rely on basic documentation in training 
programs, they listen to a lecture from a specialist several times, taking notes ( Lenorovitz, 

1 982). The Soviet program also places more emphasis on psychological status than the U.S. 
program , with tests given at training to assure psychological compatibility with crew members, 
and regular psychological monitoring during and after flight (Borrowman, 1982;Bluth. 1962; 
Oberg and Oberg, 1 986). Future U.S. missions must concentrate more on psychological status of 
crew members during training (Conners etal., 1985; Collins, 1985). 


5.2. SPACE STATION TRAINING 

A recently published document ( NASA, 1 986) detailed training requirements for the near- 
operatioal space station, to be launched in the early I990’s. This section will summarize 
important aspects of this paper. 

3.2. 1 . On-Orbit Versus Ground-Based T raining 

Specific criteria have been imposed to assign training to on-orbit or ground-based trainers. 
On-orbit training is preferred for complex psychomotor skills, or time-critical procedures, 
safety drills, and maintenance of group behavioral dynamics. This training is preferred for 
microgravity, low-cost, and low probability of occurrance tasks. On-orbit refresher training 
will also be carried out prior to unscheduled maintenance tasks. Ground-based training will be 
preferred for fundamental , safety-critical tasks, such as space station activation or medical 
procedures. Basic training in group dynamics and habitability will be carried out on the ground. 
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Ground-based training will , in general , be preferred for prerequisite skill acquisition and 
day-to-day space station operations. On-orbit training is a supplament to basic ground- trained 
skills, so few skills will only be trained on-orbit. 

Initially, most training will be performed on the ground, but more and more training will be 
performed on-orbit as the space station program matures. Ground-based training will include 

( 1 ) ingress and egress to and from space station, (2) activation and deactivation of space station, 

( 3) systems training, with emphasis on understanding, ( 4) spacecraft docking and tethering, ( 5) 
RMS and robotics. ( 6) orbital management and communications, ( 7) habitability systems. (8) 
safety, emergency, medical and maintenance procedures, (9) integrated simulations, stressing 
team approach to problems. On-orbit training will include ( 1 ) spacecraft dxking and tethering, 

( 2) refresher training on RMS, ( 3) crew rescue EVA, (4) handling of fuels and other hazardous 
materials, ( 5) use of avionics equipment, ( 6) emergency and malfunction procedures, ( 7) 
manned systems refresher training. Ground-based training will initially be performed for all 
phases and activities. Eventually, primary training for some skills or activities will be shifted to 
on-orbit. This training will be available both to NASA astronauts and to contractors or customers 
of NASA. 

3.2.2. Training Breadth 

Training programs for the space station will be developed for the crew, ground-based flight 
controllers, and training instructors. The crew members will require more training at greater 
frequency for procedural skills than for psychomotor skills. Launch schedules will impose 
training duration limits. There are still questions as to the relative amount of self-paced 
training, amount of on-board training, and relative training differences between crew members. 
Flight controllers will initially take part in full integrated mission simulations, however later in 
the space station program , fewer formal simulations will be conducted. Eventually, no joint 
controller-crew simulation will take place, due to their inherent complexity and time 
consumption. I nstructors must also be trained in procedures. Questions exist as to the number of 
required instructors per crew member, and the complexity of their simulation scripts. 

The general direction for breadth of training is one of initial full scale, integrated simulations 
involving all parties tapering to later separate simulations of mission components. This change 
will be required to shorten the training time of space station crews, and to decrease the cost of 
rotating crews. Some amount of procedural or psychomotor skill practice, such as one- half hour 
per day. will be mandatory on-orbit. 

3.2.3. Training Technologies and Facilities 

Space station training will make extensive use of computer aided and adaptive instruction. 
Computer aided instruction systems permit a consistent presentation of material in a given 
sequential order . Intelligent or adaptive computer aided instruction allows material to be 
resequenced or altered according to the needs of a trainee (Morgan and Erb, 1 986). Both of these 
types of systems will be utilized in ground-based and on-orbit training. Intelligent systems will 
be designed to serve as a coach , rather than as a tutor or manager, in that advice is provided to the 
trainee trying to meet an educational objective. 
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These instructional programs will be implemented on interactive laser videodisk storage systems. 
The trainee will respond via keyboard and voice. The system will output via television monitors, 
wide angle visuals, helmet mounted video, and voice synthesis. Appropriate videodisks for every 
required repair or maintenance will be onboard; the capacity for transmitting the equivalent 
content of a videodisk directly into the training hardware may also be present. These systems 
offer many advantages over conventional computer-based trainers. They are potentially very 
small and portable, still or moving scenes are of higher fidelity than computer graphics, and it is 
cheaper to film s«|uences of movements for videodisk interpretation than developing reliable 
graphics via computer. Astronauts on EVA will have the capability of viewing procedures in a 
helmet mounted display as they are performed. These videodisk trainers can also contain other 
controller att^hments to allow realistic practice with complex psychomotor skills. 

Space station training will also be embedded within operational controls and displays. By 
monitoring performance while a trainee attempts to complete a given task, better involvement and 
motivation are achieved. The monitor will act as a coach, much like a master-apprentice scenario. 

Ground-based NASA training facilites for the space station missions will include ( 1 ) manned test 
facility, for engineering design and testing, (2) mxkup and integration laboratory, for RMS 
training, ( 3) weightless environment training facility, for crew training, ( 4) systems 
engineeringsimulator , for flight training, (5) space station training facility, for high-fidelity 
crew training, ( 6) shuttle rendezvous simulator, and (7) integrated EVA simulator. As in the 
shuttle training program, training will start with part-task, single-system trainers, and end 
with full-mission, multiple system simulation. Much of the shuttle training facilities will be 
utilized for space station training. 


3.4. MARS MISSION TRAINING 

Until an empirically determined model of training requirements has been developed, all conceptual 
designs are merely "straw-man” estimates. However , based on previous sections of this paper , 
some recommendations may be made. 

On-Board Versus Ground-Based Training. Whether training should be given on-board or on the 
ground should not be an issue on a Mars mission. The crew members must have the necessary 
resources to rehearse procedural lists and psychomotor skills whenever required. Ground-based 
training should consist of academic systems overviews and those skills required for complete 
systems understanding. It is imperitive that complete understanding be achieved prior to flight, 
as effective refresher training can only be assurred with well organized tasks. Ground-based 
training might include other knowledge acquisition, beyond the immediate scope of the mission, to 
guard against unexpected events. As an example, this training might include psychological or 
social models of small groups. In general, ground-based training should be academic and broad, 
while on-board required training shoud be specific and skill oriented. Of course, pilots will 
require all training prior to the mission. 

Scope of Training. Ground-based and on-board training and refresher programs must be designed 
to counteract the negative aspects of the space environment, as discussed in Section 1 , in addition 
to maintaining skill and knowledge retention. Skills must be regularly refreshed, according to a 
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yet-to-be developed model of retention time. Using such a model , a computer program could list, 
on a daily basis, those skills or procedures that need refresher training. Ideally, a crew member's 
required refresher training should be determined automatically and adaptively. Periodic 
performance measurement on a testing battery could indicate level of retention and pinpoint areas 
for needed training. Training for crew autonomy and confinement will be harder to define, until 
more is known. Drills may be required to measure the cohesiveness of the crew. Lists of critical 
procedures must be regularly reviewed and trained, as should the dally workload level. As 
measured by a model , mental and physical workload must be constantly reviewed and reallocated 
among the crew members. 

All crew members should be encouraged to develop expertise, while in- transit to Mars, in 
academic fields other than their own. The on-board teaching expertise clearly will exist. 
Establishing a formal instructional regimen will aid in maintaining cognitive abilites of both 
teachers and students. Healthy interaction between the crew members will also be maintained. 

The outcome of such concentrated training could even consist of additional academic degrees. 

Periodic Drills. Emergency and disaster drills should be conducted, as called for by either ground 
control or by the on-board commander. Many controls could be placed in an alternate, embedded 
training mode for conducting these drills. Images of impending meteorites, etc. , could even be 
projected onto displays or windows. Crew performance should be reviewed by the commander , and 
necessary refresher training conducted. 

Recreation. Off-duty periods also present a good oppurtunity for procedural and operational skill 
maintenance. Video games, music, etc. , all present unique practice opportunities for different 
skills. 

Hardware. A small , portable, videodisk based computer system with voice input and output may 
serve as a generic trainer and recreation device. Such a device will allow practice of skills 
anywhere and anytime on a mission. Different videodisks could be loaded for different procedures, 
and others could be loaded for entertainment. 

A Mars mission presents many challenges far beyond those that have already been approached. 

This oppurtunity should be seized for pushing the state-of-the-art in knowledge of human 
training and skill retention. This paper has stressed the development of empirical models, the 
only unbiased approach to defining training needs. As the research required to achieve these 
models will take many years, now is the time to start. A long duration space mission will require 
an understanding of psychological limitations in all mission phases. This report has stressed the 
need for modeling these limitations in light of training requirements, whether initial or refresher 
training. The proper, scientific method of training definition will require a model of skill 
retetention, as argued here. 
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